Co-Allocation Based Scheduling For Parallel Systems

نویسندگان

  • Seren Soner
  • Can Özturan
  • Oğuz Tosun
چکیده

State-of-the-art supercomputers are made up of multiple types of resources. User jobs also have wide spectrum of resource requirements. Hence, a supercomputer can be thought of as a collection of heterogeneous resources with heterogeneous usage requirements from the users. Schedulers for such systems are challenged by several issues like scalability, GPU, topology and energy awareness. We view each scheduling step as solving a coallocation problem, i.e. the problem of allocating multiple resources simultaneously to jobs. Collection of jobs can be repeatedly taken from the front of the job queue (i.e. a window of jobs) and a co-allocation problem formulated as an (integer) linear program (ILP/LP) can be solved. ILP formulations for single-type and multiple instances, a CPU-GPU and generalized systems are provided. Co-allocation solver is applied to both the window of jobs and the backfilled jobs. Simulation results show effectiveness of our approaches when compared with pure first-come-firstserved schedulers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

OASS: A Mixed-Integer Programming Scheduling Model for Ordering and Allocating Parallel Jobs on Multi-Cluster Systems

Multi-cluster environments are composed of multiple clusters of computers that act collaboratively, thus allowing computational problems that require more resources than those available in a single cluster to be treated. However, the degree of complexity of the scheduling process is greatly increased by the heterogeneity of resources and the co-allocation process, which distributes the tasks of...

متن کامل

Goal programming-based post-disaster decision making for allocation and scheduling the rescue units in natural disaster with time-window

Natural disasters, such as earthquakes, tsunamis, and hurricanes cause enormous harm during each year. To reduce casualties and economic losses in the response phase, rescue units must be allocated and scheduled efficiently, such that it is a key issues in emergency response. In this paper, a multi-objective mix integer nonlinear programming model (MOMINLP) is proposed to minimize sum of weight...

متن کامل

Developing a method for reliability allocation of series-parallel systems by considering common cause failure

Reliability allocation has an essential connection to design for reliability and is an important activity in the product design and development process. In determining the reliability of subsystems or components on the basis of goal reliability, attention must be paid to failure effect, failure information, and improvement opportunities based upon real potentials for reliability improvement. In...

متن کامل

Genetic-based Approach for Scheduling, Allocation, and Mapping for HW/SW Co-Design with Dynamic Allocation Threshold

We develop a genetic-based approach for system-level architecture synthesis for scheduling, allocation, and mapping. Traditional design practices lead to over allocating resources for embedded SoC and, hence, high manufacture cost. The approach determines the scheduling, allocation, and mapping for the system at every step to find the near optimal solution. Unlike other genetic approach for par...

متن کامل

Green Energy-aware task scheduling using the DVFS technique in Cloud Computing

Nowdays, energy consumption as a critical issue in distributed computing systems with high performance has become so green computing tries to energy consumption, carbon footprint and CO2 emissions in high performance computing systems (HPCs) such as clusters, Grid and Cloud that a large number of parallel. Reducing energy consumption for high end computing can bring various benefits such as red...

متن کامل

Static Task Allocation in Distributed Systems Using Parallel Genetic Algorithm

Over the past two decades, PC speeds have increased from a few instructions per second to several million instructions per second. The tremendous speed of today's networks as well as the increasing need for high-performance systems has made researchers interested in parallel and distributed computing. The rapid growth of distributed systems has led to a variety of problems. Task allocation is a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011